Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 16175430 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.3 GiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 11 |
|---|
LongitudAcc is highly correlated with Fuel Rate and 2 other fields | High correlation |
EngineSpeed is highly correlated with EngineAirInletPressure and 2 other fields | High correlation |
Fuel Rate is highly correlated with Engine Load and 1 other fields | High correlation |
Engine Load is highly correlated with Boost Pressure and 2 other fields | High correlation |
Boost Pressure is highly correlated with Engine Load and 2 other fields | High correlation |
EngineAirInletPressure is highly correlated with EngineSpeed and 3 other fields | High correlation |
AcceleratorPedalPos is highly correlated with EngineSpeed and 4 other fields | High correlation |
VehicleSpeed is highly correlated with EngineSpeed | High correlation |
BrakePedalPos is highly correlated with AcceleratorPedalPos | High correlation |
Fuel Rate is highly skewed (γ1 = 45.05381217) | Skewed |
Timestamp has unique values | Unique |
LongitudAcc has 3749236 (23.2%) zeros | Zeros |
EngineSpeed has 297129 (1.8%) zeros | Zeros |
Fuel Rate has 3737573 (23.1%) zeros | Zeros |
Engine Load has 3754590 (23.2%) zeros | Zeros |
Boost Pressure has 758809 (4.7%) zeros | Zeros |
AcceleratorPedalPos has 6243511 (38.6%) zeros | Zeros |
VehicleSpeed has 2234261 (13.8%) zeros | Zeros |
BrakePedalPos has 13229388 (81.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-23 15:37:28.295765 |
|---|---|
| Analysis finished | 2022-11-23 15:56:20.660629 |
| Duration | 18 minutes and 52.36 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 16175430 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.084673856 × 1010 |
| Minimum | 4.758328021 × 1010 |
|---|---|
| Maximum | 1.117813444 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 4.758328021 × 1010 |
|---|---|
| 5-th percentile | 5.127381075 × 1010 |
| Q1 | 6.267834658 × 1010 |
| median | 8.243642692 × 1010 |
| Q3 | 9.621343434 × 1010 |
| 95-th percentile | 1.080474353 × 1011 |
| Maximum | 1.117813444 × 1011 |
| Range | 6.419806414 × 1010 |
| Interquartile range (IQR) | 3.353508776 × 1010 |
Descriptive statistics
| Standard deviation | 1.837034532 × 1010 |
|---|---|
| Coefficient of variation (CV) | 0.2272243215 |
| Kurtosis | -1.111404269 |
| Mean | 8.084673856 × 1010 |
| Median Absolute Deviation (MAD) | 1.411740533 × 1010 |
| Skewness | -0.1751080103 |
| Sum | 1.30773076 × 1018 |
| Variance | 3.374695871 × 1020 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4.758328021 × 1010 | 1 | < 0.1% |
| 9.106960468 × 1010 | 1 | < 0.1% |
| 9.106960663 × 1010 | 1 | < 0.1% |
| 9.106960771 × 1010 | 1 | < 0.1% |
| 9.10696086 × 1010 | 1 | < 0.1% |
| 9.106960968 × 1010 | 1 | < 0.1% |
| 9.106961068 × 1010 | 1 | < 0.1% |
| 9.106961176 × 1010 | 1 | < 0.1% |
| 9.106961262 × 1010 | 1 | < 0.1% |
| 9.10696137 × 1010 | 1 | < 0.1% |
| Other values (16175420) | 16175420 |
| Value | Count | Frequency (%) |
| 4.758328021 × 1010 | 1 | |
| 4.75832813 × 1010 | 1 | |
| 4.758328232 × 1010 | 1 | |
| 4.758328312 × 1010 | 1 | |
| 4.758328422 × 1010 | 1 | |
| 4.758328530 × 1010 | 1 | |
| 4.758328634 × 1010 | 1 | |
| 4.758328712 × 1010 | 1 | |
| 4.758328812 × 1010 | 1 | |
| 4.758328922 × 1010 | 1 |
| Value | Count | Frequency (%) |
| 1.117813444 × 1011 | 1 | |
| 1.117813437 × 1011 | 1 | |
| 1.117813424 × 1011 | 1 | |
| 1.117813413 × 1011 | 1 | |
| 1.117813406 × 1011 | 1 | |
| 1.117813394 × 1011 | 1 | |
| 1.117813384 × 1011 | 1 | |
| 1.117813373 × 1011 | 1 | |
| 1.117813366 × 1011 | 1 | |
| 1.117813354 × 1011 | 1 |
WetTankAirPressure
Real number (ℝ≥0)
| Distinct | 188 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.0795195 |
| Minimum | 0 |
|---|---|
| Maximum | 12.89365 |
| Zeros | 30574 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10.2046 |
| Q1 | 10.82515 |
| median | 11.1699 |
| Q3 | 11.51465 |
| 95-th percentile | 11.8594 |
| Maximum | 12.89365 |
| Range | 12.89365 |
| Interquartile range (IQR) | 0.6895 |
Descriptive statistics
| Standard deviation | 0.9277010234 |
|---|---|
| Coefficient of variation (CV) | 0.08373116031 |
| Kurtosis | 59.62602948 |
| Mean | 11.0795195 |
| Median Absolute Deviation (MAD) | 0.34475 |
| Skewness | -6.287830511 |
| Sum | 179215992.1 |
| Variance | 0.8606291887 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11.032 | 849388 | 5.3% |
| 11.3078 | 832510 | 5.1% |
| 11.10095 | 828945 | 5.1% |
| 10.82515 | 827575 | 5.1% |
| 11.4457 | 819910 | 5.1% |
| 11.37675 | 818766 | 5.1% |
| 10.96305 | 782144 | 4.8% |
| 10.7562 | 770063 | 4.8% |
| 11.23885 | 764861 | 4.7% |
| 11.65255 | 740124 | 4.6% |
| Other values (178) | 8141144 |
| Value | Count | Frequency (%) |
| 0 | 30574 | |
| 0.06895 | 463 | < 0.1% |
| 0.1379 | 424 | < 0.1% |
| 0.20685 | 390 | < 0.1% |
| 0.2758 | 260 | < 0.1% |
| 0.34475 | 323 | < 0.1% |
| 0.4137 | 317 | < 0.1% |
| 0.48265 | 380 | < 0.1% |
| 0.5516 | 310 | < 0.1% |
| 0.62055 | 265 | < 0.1% |
| Value | Count | Frequency (%) |
| 12.89365 | 1 | < 0.1% |
| 12.8247 | 2 | < 0.1% |
| 12.75575 | 2 | < 0.1% |
| 12.6868 | 12 | < 0.1% |
| 12.61785 | 11 | < 0.1% |
| 12.5489 | 34 | < 0.1% |
| 12.47995 | 96 | < 0.1% |
| 12.411 | 179 | < 0.1% |
| 12.34205 | 551 | < 0.1% |
| 12.2731 | 1561 |
| Distinct | 125 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.03213855211 |
| Minimum | -7.1 |
|---|---|
| Maximum | 13 |
| Zeros | 3749236 |
| Zeros (%) | 23.2% |
| Negative | 6713262 |
| Negative (%) | 41.5% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | -7.1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -0.2 |
| median | 0 |
| Q3 | 0.2 |
| 95-th percentile | 0.8 |
| Maximum | 13 |
| Range | 20.1 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.5870923095 |
|---|---|
| Coefficient of variation (CV) | -18.26754072 |
| Kurtosis | 131.3431163 |
| Mean | -0.03213855211 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 5.553683143 |
| Sum | -519854.9 |
| Variance | 0.3446773799 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3749236 | |
| -0.1 | 1601736 | |
| 0.1 | 1365021 | 8.4% |
| -0.2 | 1360163 | 8.4% |
| 0.2 | 1028054 | 6.4% |
| -0.3 | 969350 | 6.0% |
| 0.3 | 771181 | 4.8% |
| -0.4 | 656712 | 4.1% |
| 0.4 | 564522 | 3.5% |
| 0.5 | 453809 | 2.8% |
| Other values (115) | 3655646 |
| Value | Count | Frequency (%) |
| -7.1 | 1 | < 0.1% |
| -7 | 2 | < 0.1% |
| -6.6 | 1 | < 0.1% |
| -6.4 | 2 | < 0.1% |
| -6.3 | 1 | < 0.1% |
| -6.2 | 1 | < 0.1% |
| -6.1 | 1 | < 0.1% |
| -5.8 | 5 | |
| -5.6 | 4 | |
| -5.4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 13 | 8703 | |
| 12.9 | 10 | < 0.1% |
| 6.7 | 1 | < 0.1% |
| 6 | 3 | < 0.1% |
| 5.7 | 2 | < 0.1% |
| 5.6 | 2 | < 0.1% |
| 5.5 | 6 | < 0.1% |
| 5.4 | 8 | < 0.1% |
| 5.3 | 9 | < 0.1% |
| 5.2 | 18 | < 0.1% |
| Distinct | 11881 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1076.56738 |
| Minimum | 0 |
|---|---|
| Maximum | 8191.875 |
| Zeros | 297129 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 590 |
| Q1 | 906.125 |
| median | 1163.75 |
| Q3 | 1291.125 |
| 95-th percentile | 1463.875 |
| Maximum | 8191.875 |
| Range | 8191.875 |
| Interquartile range (IQR) | 385 |
Descriptive statistics
| Standard deviation | 322.2456133 |
|---|---|
| Coefficient of variation (CV) | 0.2993269341 |
| Kurtosis | 3.630577238 |
| Mean | 1076.56738 |
| Median Absolute Deviation (MAD) | 156.625 |
| Skewness | -0.785907382 |
| Sum | 1.74139403 × 1010 |
| Variance | 103842.2353 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 297129 | 1.8% |
| 600.25 | 26175 | 0.2% |
| 600.5 | 26109 | 0.2% |
| 600 | 25962 | 0.2% |
| 599.75 | 25870 | 0.2% |
| 599.5 | 25641 | 0.2% |
| 599.25 | 25594 | 0.2% |
| 600.875 | 25414 | 0.2% |
| 601.125 | 25208 | 0.2% |
| 601.375 | 24669 | 0.2% |
| Other values (11871) | 15647659 |
| Value | Count | Frequency (%) |
| 0 | 297129 | |
| 15.375 | 1 | < 0.1% |
| 16.125 | 2 | < 0.1% |
| 16.75 | 1 | < 0.1% |
| 17.125 | 1 | < 0.1% |
| 17.25 | 1 | < 0.1% |
| 17.375 | 2 | < 0.1% |
| 17.625 | 6 | < 0.1% |
| 17.75 | 8 | < 0.1% |
| 17.875 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 8191.875 | 194 | |
| 2250.25 | 1 | < 0.1% |
| 2176.25 | 1 | < 0.1% |
| 2158.875 | 1 | < 0.1% |
| 2149.625 | 1 | < 0.1% |
| 2138.625 | 1 | < 0.1% |
| 2131.25 | 1 | < 0.1% |
| 2129.875 | 1 | < 0.1% |
| 2128.5 | 1 | < 0.1% |
| 2128.125 | 1 | < 0.1% |
| Distinct | 1146 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.86278092 |
| Minimum | 0 |
|---|---|
| Maximum | 3876.198645 |
| Zeros | 3737573 |
| Zeros (%) | 23.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.301234 |
| median | 8.28058 |
| Q3 | 22.239272 |
| 95-th percentile | 48.086511 |
| Maximum | 3876.198645 |
| Range | 3876.198645 |
| Interquartile range (IQR) | 20.938038 |
Descriptive statistics
| Standard deviation | 82.57428514 |
|---|---|
| Coefficient of variation (CV) | 5.205536506 |
| Kurtosis | 2101.817277 |
| Mean | 15.86278092 |
| Median Absolute Deviation (MAD) | 8.28058 |
| Skewness | 45.05381217 |
| Sum | 256587302.3 |
| Variance | 6818.512566 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3737573 | 23.1% |
| 3.489673 | 103654 | 0.6% |
| 3.430526 | 103116 | 0.6% |
| 3.54882 | 102085 | 0.6% |
| 3.607967 | 98549 | 0.6% |
| 3.667114 | 95051 | 0.6% |
| 3.371379 | 95051 | 0.6% |
| 3.726261 | 94689 | 0.6% |
| 3.785408 | 94479 | 0.6% |
| 3.844555 | 94012 | 0.6% |
| Other values (1136) | 11557171 |
| Value | Count | Frequency (%) |
| 0 | 3737573 | |
| 0.059147 | 15315 | 0.1% |
| 0.118294 | 15058 | 0.1% |
| 0.177441 | 18192 | 0.1% |
| 0.236588 | 23390 | 0.1% |
| 0.295735 | 21374 | 0.1% |
| 0.354882 | 19379 | 0.1% |
| 0.414029 | 18144 | 0.1% |
| 0.473176 | 15722 | 0.1% |
| 0.532323 | 13350 | 0.1% |
| Value | Count | Frequency (%) |
| 3876.198645 | 7121 | |
| 2773.225389 | 1 | < 0.1% |
| 2707.867954 | 1 | < 0.1% |
| 2670.309609 | 1 | < 0.1% |
| 2660.668648 | 1 | < 0.1% |
| 2619.857218 | 1 | < 0.1% |
| 2551.187551 | 1 | < 0.1% |
| 2551.069257 | 1 | < 0.1% |
| 2525.340312 | 1 | < 0.1% |
| 2523.21102 | 1 | < 0.1% |
| Distinct | 201 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.14421947 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 3754590 |
| Zeros (%) | 23.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3.5 |
| median | 25.5 |
| Q3 | 46.5 |
| 95-th percentile | 92.5 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 43 |
Descriptive statistics
| Standard deviation | 28.21204881 |
|---|---|
| Coefficient of variation (CV) | 0.9058518495 |
| Kurtosis | -0.1126133869 |
| Mean | 31.14421947 |
| Median Absolute Deviation (MAD) | 21.5 |
| Skewness | 0.8287591997 |
| Sum | 503771142 |
| Variance | 795.9196981 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3754590 | 23.2% |
| 100 | 539465 | 3.3% |
| 20.5 | 195811 | 1.2% |
| 22.5 | 195537 | 1.2% |
| 22 | 192780 | 1.2% |
| 21.5 | 192211 | 1.2% |
| 21 | 192132 | 1.2% |
| 23 | 190179 | 1.2% |
| 20 | 189390 | 1.2% |
| 23.5 | 183555 | 1.1% |
| Other values (191) | 10349780 |
| Value | Count | Frequency (%) |
| 0 | 3754590 | |
| 0.5 | 66062 | 0.4% |
| 1 | 51293 | 0.3% |
| 1.5 | 39864 | 0.2% |
| 2 | 37357 | 0.2% |
| 2.5 | 33718 | 0.2% |
| 3 | 35533 | 0.2% |
| 3.5 | 33649 | 0.2% |
| 4 | 37809 | 0.2% |
| 4.5 | 34726 | 0.2% |
| Value | Count | Frequency (%) |
| 100 | 539465 | |
| 99.5 | 15646 | 0.1% |
| 99 | 16937 | 0.1% |
| 98.5 | 19239 | 0.1% |
| 98 | 18438 | 0.1% |
| 97.5 | 18209 | 0.1% |
| 97 | 18034 | 0.1% |
| 96.5 | 17829 | 0.1% |
| 96 | 18112 | 0.1% |
| 95.5 | 17112 | 0.1% |
| Distinct | 190 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2407906249 |
| Minimum | 0 |
|---|---|
| Maximum | 1.628802 |
| Zeros | 758809 |
| Zeros (%) | 4.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.008618 |
| Q1 | 0.060326 |
| median | 0.12927 |
| Q3 | 0.336102 |
| 95-th percentile | 0.844564 |
| Maximum | 1.628802 |
| Range | 1.628802 |
| Interquartile range (IQR) | 0.275776 |
Descriptive statistics
| Standard deviation | 0.2753647173 |
|---|---|
| Coefficient of variation (CV) | 1.143585708 |
| Kurtosis | 3.631207411 |
| Mean | 0.2407906249 |
| Median Absolute Deviation (MAD) | 0.112034 |
| Skewness | 1.886434661 |
| Sum | 3894891.897 |
| Variance | 0.07582572755 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.008618 | 953943 | 5.9% |
| 0.017236 | 923879 | 5.7% |
| 0 | 758809 | 4.7% |
| 0.094798 | 630800 | 3.9% |
| 0.103416 | 605803 | 3.7% |
| 0.08618 | 582021 | 3.6% |
| 0.112034 | 533571 | 3.3% |
| 0.077562 | 468259 | 2.9% |
| 0.120652 | 442572 | 2.7% |
| 0.025854 | 409955 | 2.5% |
| Other values (180) | 9865818 |
| Value | Count | Frequency (%) |
| 0 | 758809 | |
| 0.008618 | 953943 | |
| 0.017236 | 923879 | |
| 0.025854 | 409955 | |
| 0.034472 | 331147 | 2.0% |
| 0.04309 | 285919 | 1.8% |
| 0.051708 | 274049 | 1.7% |
| 0.060326 | 294336 | 1.8% |
| 0.068944 | 358841 | 2.2% |
| 0.077562 | 468259 |
| Value | Count | Frequency (%) |
| 1.628802 | 4 | < 0.1% |
| 1.620184 | 7 | < 0.1% |
| 1.611566 | 9 | < 0.1% |
| 1.602948 | 8 | < 0.1% |
| 1.59433 | 8 | < 0.1% |
| 1.585712 | 10 | < 0.1% |
| 1.577094 | 36 | < 0.1% |
| 1.568476 | 48 | < 0.1% |
| 1.559858 | 108 | |
| 1.55124 | 131 |
| Distinct | 95 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 125.1348283 |
| Minimum | 50 |
|---|---|
| Maximum | 510 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 102 |
| Q1 | 106 |
| median | 114 |
| Q3 | 134 |
| 95-th percentile | 186 |
| Maximum | 510 |
| Range | 460 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 27.57660281 |
|---|---|
| Coefficient of variation (CV) | 0.22037512 |
| Kurtosis | 4.068619548 |
| Mean | 125.1348283 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.906545805 |
| Sum | 2024109656 |
| Variance | 760.4690224 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 102 | 1604595 | 9.9% |
| 110 | 1308048 | 8.1% |
| 112 | 1259388 | 7.8% |
| 104 | 1127369 | 7.0% |
| 108 | 900214 | 5.6% |
| 114 | 854384 | 5.3% |
| 106 | 712282 | 4.4% |
| 116 | 654743 | 4.0% |
| 100 | 520470 | 3.2% |
| 118 | 496468 | 3.1% |
| Other values (85) | 6737469 |
| Value | Count | Frequency (%) |
| 50 | 2 | < 0.1% |
| 52 | 1 | < 0.1% |
| 68 | 5 | < 0.1% |
| 82 | 1 | < 0.1% |
| 84 | 26 | < 0.1% |
| 86 | 21 | < 0.1% |
| 88 | 1 | < 0.1% |
| 94 | 535 | < 0.1% |
| 96 | 8036 | < 0.1% |
| 98 | 81245 |
| Value | Count | Frequency (%) |
| 510 | 202 | < 0.1% |
| 508 | 4 | < 0.1% |
| 264 | 11 | < 0.1% |
| 262 | 22 | < 0.1% |
| 260 | 39 | < 0.1% |
| 258 | 157 | < 0.1% |
| 256 | 390 | < 0.1% |
| 254 | 790 | < 0.1% |
| 252 | 1565 | |
| 250 | 2633 |
| Distinct | 251 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.37506754 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 6243511 |
| Zeros (%) | 38.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 42.4 |
| Q3 | 68 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 68 |
Descriptive statistics
| Standard deviation | 35.39829833 |
|---|---|
| Coefficient of variation (CV) | 0.9224296032 |
| Kurtosis | -1.422346462 |
| Mean | 38.37506754 |
| Median Absolute Deviation (MAD) | 42.4 |
| Skewness | 0.1983658573 |
| Sum | 620733218.8 |
| Variance | 1253.039524 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 6243511 | |
| 100 | 927747 | 5.7% |
| 64.8 | 78073 | 0.5% |
| 65.6 | 77452 | 0.5% |
| 64.4 | 76921 | 0.5% |
| 62.4 | 76880 | 0.5% |
| 66.4 | 76496 | 0.5% |
| 61.2 | 76029 | 0.5% |
| 67.2 | 75591 | 0.5% |
| 60.8 | 75578 | 0.5% |
| Other values (241) | 8391152 |
| Value | Count | Frequency (%) |
| 0 | 6243511 | |
| 0.4 | 6397 | < 0.1% |
| 0.8 | 6752 | < 0.1% |
| 1.2 | 6583 | < 0.1% |
| 1.6 | 6866 | < 0.1% |
| 2 | 6457 | < 0.1% |
| 2.4 | 6722 | < 0.1% |
| 2.8 | 6948 | < 0.1% |
| 3.2 | 6586 | < 0.1% |
| 3.6 | 6996 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 927747 | |
| 99.6 | 19014 | 0.1% |
| 99.2 | 19674 | 0.1% |
| 98.8 | 18787 | 0.1% |
| 98.4 | 19574 | 0.1% |
| 98 | 19968 | 0.1% |
| 97.6 | 20092 | 0.1% |
| 97.2 | 20498 | 0.1% |
| 96.8 | 19735 | 0.1% |
| 96.4 | 20739 | 0.1% |
| Distinct | 1044 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.12837527 |
| Minimum | 0 |
|---|---|
| Maximum | 255.97971 |
| Zeros | 2234261 |
| Zeros (%) | 13.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 16.498944 |
| median | 38.794392 |
| Q3 | 56.394828 |
| 95-th percentile | 75.592818 |
| Maximum | 255.97971 |
| Range | 255.97971 |
| Interquartile range (IQR) | 39.895884 |
Descriptive statistics
| Standard deviation | 24.6526816 |
|---|---|
| Coefficient of variation (CV) | 0.6639849286 |
| Kurtosis | -0.4503887306 |
| Mean | 37.12837527 |
| Median Absolute Deviation (MAD) | 19.90107 |
| Skewness | 0.06865688 |
| Sum | 600567435.2 |
| Variance | 607.7547103 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2234261 | 13.8% |
| 48.996864 | 29100 | 0.2% |
| 48.496896 | 27740 | 0.2% |
| 48.196134 | 27726 | 0.2% |
| 47.895372 | 27399 | 0.2% |
| 47.696166 | 27291 | 0.2% |
| 46.293912 | 27006 | 0.2% |
| 46.895436 | 26987 | 0.2% |
| 46.996992 | 26987 | 0.2% |
| 47.49696 | 26803 | 0.2% |
| Other values (1034) | 13694130 |
| Value | Count | Frequency (%) |
| 0 | 2234261 | |
| 0.999936 | 3303 | < 0.1% |
| 1.097586 | 3967 | < 0.1% |
| 1.199142 | 4689 | < 0.1% |
| 1.296792 | 5434 | < 0.1% |
| 1.398348 | 5577 | < 0.1% |
| 1.499904 | 5889 | < 0.1% |
| 1.597554 | 8135 | 0.1% |
| 1.69911 | 6405 | < 0.1% |
| 1.79676 | 6765 | < 0.1% |
| Value | Count | Frequency (%) |
| 255.97971 | 194 | < 0.1% |
| 255.975804 | 1479 | |
| 134.19063 | 1 | < 0.1% |
| 125.491968 | 1 | < 0.1% |
| 105.290136 | 1 | < 0.1% |
| 104.99328 | 1 | < 0.1% |
| 104.891724 | 4 | < 0.1% |
| 104.790168 | 2 | < 0.1% |
| 104.692518 | 2 | < 0.1% |
| 104.590962 | 1 | < 0.1% |
| Distinct | 240 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.874096874 |
| Minimum | 0 |
|---|---|
| Maximum | 96 |
| Zeros | 13229388 |
| Zeros (%) | 81.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 123.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 19.2 |
| Maximum | 96 |
| Range | 96 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 6.789764373 |
|---|---|
| Coefficient of variation (CV) | 2.362399275 |
| Kurtosis | 6.413069056 |
| Mean | 2.874096874 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.43875188 |
| Sum | 46489752.8 |
| Variance | 46.10090024 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 13229388 | |
| 16 | 166268 | 1.0% |
| 14.4 | 158445 | 1.0% |
| 14.8 | 137120 | 0.8% |
| 15.6 | 130755 | 0.8% |
| 15.2 | 125022 | 0.8% |
| 14 | 108403 | 0.7% |
| 16.4 | 97442 | 0.6% |
| 13.6 | 95242 | 0.6% |
| 13.2 | 56159 | 0.3% |
| Other values (230) | 1871186 | 11.6% |
| Value | Count | Frequency (%) |
| 0 | 13229388 | |
| 0.4 | 48082 | 0.3% |
| 0.8 | 31742 | 0.2% |
| 1.2 | 19269 | 0.1% |
| 1.6 | 14231 | 0.1% |
| 2 | 14547 | 0.1% |
| 2.4 | 13760 | 0.1% |
| 2.8 | 14155 | 0.1% |
| 3.2 | 15200 | 0.1% |
| 3.6 | 12828 | 0.1% |
| Value | Count | Frequency (%) |
| 96 | 155 | |
| 95.6 | 196 | |
| 95.2 | 3 | < 0.1% |
| 94.8 | 4 | < 0.1% |
| 94.4 | 2 | < 0.1% |
| 94 | 11 | < 0.1% |
| 93.6 | 8 | < 0.1% |
| 93.2 | 6 | < 0.1% |
| 92.8 | 2 | < 0.1% |
| 92.4 | 3 | < 0.1% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 4.758328e+10 | 11.03200 | 0.7 | 582.625 | 8.517168 | 49.5 | 0.017236 | 102.0 | 40.4 | 0.000000 | 0.0 |
| 1 | 4.758328e+10 | 11.03200 | 1.0 | 664.500 | 16.442866 | 75.5 | 0.043090 | 104.0 | 66.0 | 4.999680 | 0.0 |
| 2 | 4.758328e+10 | 11.03200 | 1.1 | 1123.875 | 30.283264 | 66.0 | 0.112034 | 108.0 | 82.0 | 9.097074 | 0.0 |
| 3 | 4.758328e+10 | 11.03200 | 1.3 | 1656.500 | 45.661484 | 80.0 | 0.232686 | 126.0 | 94.0 | 13.999104 | 0.0 |
| 4 | 4.758328e+10 | 11.03200 | -0.2 | 1767.375 | 15.851396 | 0.0 | 0.491226 | 152.0 | 100.0 | 16.596594 | 0.0 |
| 5 | 4.758329e+10 | 10.96305 | -0.3 | 930.125 | 9.818402 | 20.0 | 0.456754 | 150.0 | 100.0 | 16.096626 | 0.0 |
| 6 | 4.758329e+10 | 10.96305 | 1.1 | 1055.000 | 32.885732 | 84.0 | 0.284394 | 146.0 | 100.0 | 18.596466 | 0.0 |
| 7 | 4.758329e+10 | 10.96305 | 1.0 | 1304.625 | 44.182809 | 90.0 | 0.491226 | 174.0 | 100.0 | 22.896972 | 0.0 |
| 8 | 4.758329e+10 | 10.96305 | 1.3 | 1507.375 | 57.136002 | 100.0 | 0.758384 | 206.0 | 100.0 | 27.095922 | 0.0 |
| 9 | 4.758329e+10 | 10.96305 | 0.2 | 1687.500 | 33.713790 | 45.5 | 1.060014 | 176.0 | 100.0 | 30.896460 | 0.0 |
Last rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 16175420 | 1.117813e+11 | 11.79045 | -1.0 | 661.375 | 3.785408 | 17.0 | 0.094798 | 108.0 | 0.0 | 4.398156 | 18.4 |
| 16175421 | 1.117813e+11 | 11.79045 | 0.0 | 613.625 | 5.204936 | 31.5 | 0.051708 | 106.0 | 0.0 | 2.199078 | 4.0 |
| 16175422 | 1.117813e+11 | 11.72150 | -1.4 | 599.625 | 4.258584 | 25.0 | 0.034472 | 106.0 | 0.0 | 0.000000 | 14.8 |
| 16175423 | 1.117813e+11 | 11.72150 | 0.0 | 602.250 | 4.081143 | 23.5 | 0.025854 | 104.0 | 0.0 | 0.000000 | 25.2 |
| 16175424 | 1.117813e+11 | 11.65255 | 0.0 | 599.375 | 4.081143 | 23.5 | 0.017236 | 104.0 | 0.0 | 0.000000 | 31.2 |
| 16175425 | 1.117813e+11 | 11.58360 | 0.0 | 605.000 | 4.613466 | 24.0 | 0.017236 | 104.0 | 0.0 | 0.000000 | 30.4 |
| 16175426 | 1.117813e+11 | 11.58360 | 0.0 | 605.125 | 3.312232 | 19.0 | 0.017236 | 104.0 | 0.0 | 0.000000 | 10.4 |
| 16175427 | 1.117813e+11 | 11.58360 | 0.0 | 605.000 | 4.140290 | 25.0 | 0.017236 | 104.0 | 0.0 | 0.000000 | 0.0 |
| 16175428 | 1.117813e+11 | 11.58360 | 0.0 | 604.500 | 3.430526 | 19.5 | 0.017236 | 104.0 | 0.0 | 0.000000 | 0.0 |
| 16175429 | 1.117813e+11 | 11.51465 | 0.0 | 602.375 | 3.371379 | 19.5 | 0.008618 | 104.0 | 0.0 | 0.000000 | 0.0 |